3574 results found.
Written
Sentiment Analysis Tool,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons 4.0
Size:
13.5 KByte Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:Expressively vulgar: The socio-dynamics of vulgarity and its effects on sentiment analysis in social media
-
Paper track:Computationally-aided linguistic analysis
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Isabel Cachola | University of Texas at Austin | US |
| Author 2 | Eric Holgate | University of Texas at Austin | US |
| Author 3 | Daniel Preoţiuc-Pietro | University of Pennsylvania | US |
| Author 4 | Junyi Jessy Li | University of Texas at Austin | US |
| Main Contact | Isabel Cachola | University of Texas at Austin | None |
Documentation:
Available on Github repository
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
1855 sentences Production Status:
Existing-used
Use:
Emotion Recognition/Generation
-
Paper title:Learning Semantic Sentence Embeddings using Sequential Pair-wise Discriminator
-
Paper track:Computationally-aided linguistic analysis
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Badri Narayana Patro | IIT Kanpur | IN |
| Author 2 | Vinod Kumar Kurmi | IIT Kanpur | IN |
| Author 3 | Sandeep Kumar | Indian Institute of Technology at Kanpur | IN |
| Author 4 | Vinay Namboodiri | IIT Kanpur | IN |
| Main Contact | Sandeep Kumar | Indian Institute of Technology at Kanpur | None |
Documentation:
yes, in English
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Non commercial use
Size:
18755 sentences Production Status:
Newly created-finished
Use:
Natural Language Generation
-
Paper title:Automatic Corpus Extension for Data-driven Natural Language Generation
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Elena Manishina | LIA | FR |
| Author 2 | Bassam Jabaian | CERI-LIA, University of Avignon | FR |
| Author 3 | Stéphane Huet | Université d'Avignon | FR |
| Author 4 | Fabrice Lefevre | Univ. Avignon | FR |
| Main Contact | Elena Manishina | LIA | None |
Documentation:
EnglishLanguage Type:
Monolingual
Languages:
English
Availability:
Not Available
License:
n/a
Size:
see description minutes Production Status:
Existing-used
Use:
Document Classification, Text categorisation
-
Paper title:Assessing Divergence Measures for Automated Document Routing in an Adaptive MT System
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Claire Jaja | <Not Specified> | None |
| Author 2 | Douglas Briesch | <Not Specified> | None |
| Author 3 | Jamal Laoudi | <Not Specified> | None |
| Author 4 | Clare Voss | <Not Specified> | None |
| Main Contact | Claire Jaja | Army Research Lab (ARL), Advanced Resources Technologies Inc (ARTI) | US |
Documentation:
n/aLanguage Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
1.2 GByte Production Status:
Newly created-in progress
Use:
Computational Social Sciences
-
Paper title:A Corpus of Wikipedia Discussions: Over the Years, with Topic, Power and Gender Labels
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Vinodkumar Prabhakaran | Columbia University | US |
| Author 2 | Owen Rambow | Columbia University | US |
| Main Contact | Vinodkumar Prabhakaran | Stanford University | None |
Documentation:
Documentation will be provided with release, in EnglishLanguage Type:
Multilingual
Languages:
English french
Availability:
Not Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Information Extraction, Information Retrieval
-
Paper title:Annotation of specialized corpora using a comprehensive entity and relation scheme
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Louise Deléger | LIMSI-CNRS | FR |
| Author 2 | Anne-Laure Ligozat | LIMSI-CNRS & ENSIIE | FR |
| Author 3 | Cyril Grouin | LIMSI-CNRS | FR |
| Author 4 | Pierre Zweigenbaum | LIMSI-CNRS | FR |
| Author 5 | Aurélie Névéol | LIMSI-CNRS | FR |
| Main Contact | Louise Deléger | INRA | None |
Documentation:
<Not Specified>
Written
Terminology,
Language Type:
Multilingual
Languages:
English Portuguese
Availability:
Freely Available
License:
<Not Specified>
Size:
10000 entries Production Status:
Newly created-finished
Use:
<Not Specified>
-
Paper title:LexTec – a rich language resource for technical domains in Portuguese
-
Paper track:Terminology
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Palmira Marrafa | Centro de Linguística Universidade de Lisboa | PT |
| Author 2 | Raquel Amaro | Center of Linguistics of the University of Lisbon | PT |
| Author 3 | Sara Mendes | Centro de Linguística da Universidade de Lisboa | PT |
| Main Contact | Sara Mendes | Centro de Linguística da Universidade de Lisboa | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
Not Available
License:
Not Available
Size:
10 GByte Production Status:
Newly created-in progress
Use:
Text Mining
-
Paper title:Corpus for Customer Purchase Behavior Prediction in Social Media
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Shigeyuki Sakaki | Fuji Xerox | JP |
| Author 2 | Francine Chen | FX Palo Alto Laboratory | US |
| Author 3 | Mandy Korpusik | CSAIL Massachusetts Institute of Technology | US |
| Author 4 | Yan-Ying Chen | FX Palo Alto Laboratory | US |
| Main Contact | Shigeyuki Sakaki | Fuji Xerox | None |
Documentation:
No documentation.
Written
Corpus,
Language Type:
Multilingual
Languages:
English German
Availability:
Freely Available
License:
None
Size:
332054 tokens Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:ParCor 1.0: A Parallel Pronoun-Coreference Corpus to Support Statistical MT
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Liane Guillou | University of Edinburgh | GB |
| Author 2 | Christian Hardmeier | Uppsala universitet | SE |
| Author 3 | Aaron Smith | Uppsala universitet | SE |
| Author 4 | Jörg Tiedemann | Uppsala University | FI |
| Author 5 | Bonnie Webber | University of Edinburgh | GB |
| Main Contact | Liane Guillou | The University of Edinburgh | None |
Documentation:
There is documentation, it is publicly available and is in English.Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
17000 sentences Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Siamese CBOW: Optimizing Word Embeddings for Sentence Representations
-
Paper track:Empirical/Data-Driven
-
Paper status:Accept - Poster - Monday
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Tom Kenter | University of Amsterdam | NL |
| Author 2 | Alexey Borisov | University of Amsterdam, Yandex | NL |
| Author 3 | Maarten de Rijke | University of Amsterdam | NL |
| Main Contact | Tom Kenter | University of Amsterdam | None |
Documentation:
<Not Specified>




